Dialogue management for multimodal user registration
نویسندگان
چکیده
User registration refers to associating certain personal information with a user. It is widely used in hospitals, hotels and conferences. In this paper, we propose an approach to interactive user registration by combining face recognition, speech recognition and speech synthesis technologies together through an efficient dialogue manager. In order to minimize a user’s effort, we employ a new dialogue management model based on a finite state automaton (FSA), which uses a Baysian network to fuse the user’s information from multiple channels (e.g., face image, speech, records stored in a pre-constructed database) to reliably estimate the confidence about user identity. Instead of fixing weights, the FSA adjusts its weights dynamically by integrating partial information from multiple information sources. This is achieved by maximizing an objective function to determine an optimal action at each succeeding state according to current confidence and information cues. Thus the transition between states can be done along the shortest path from the initial state to the goal state. We have developed a multimodal user registration system to demonstrate the feasibility of the proposed approach.
منابع مشابه
Towards Multimodal Dialogue Management
abstract EEective dialogue management is a key issue in speech-based interfaces to information systems since it can ensure a cooperative interaction with the user. Cooperativeness requires techniques which allow the user to eeeciently access information and also techniques which compensate for limitations in system knowledge and speech technology. The paper describes management techniques devel...
متن کاملflexdiam - Flexible Dialogue Management for Incremental Interaction with Virtual Agents (Demo Paper)
We present a demonstration system for incremental spoken human–machine dialogue for task-centric domains that includes a controller for verbal and nonverbal behavior for virtual agents. The dialogue management components can handle uncertainty in input and resolve it interactively with high responsivity, and state tracking is aware of momentary events such as interruptions by the user. Aside fr...
متن کاملflexdiam – Flexible dialogue management for incremental interaction with virtual agents
We present a demonstration system for incremental spoken human–machine dialogue for task-centric domains that includes a controller for verbal and nonverbal behavior for virtual agents. The dialogue management components can handle uncertainty in input and resolve it interactively with high responsivity, and state tracking is aware of momentary events such as interruptions by the user. Aside fr...
متن کاملOn the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System
This paper reports on the creation of the multimodal NIMITEK corpus of affected behavior in human-machine interaction and its role in the development of the NIMITEK prototype system. The NIMITEK prototype system is a spoken dialogue system for supporting users while they solve problems in a graphics system. The central feature of the system is adaptive dialogue management. The system dynamicall...
متن کاملA Pattern Language for Dialogue Management
Modeling human computer interactions as dialog, while originating in voice user interfaces, is becoming increasingly important for multimodal systems. Di erent approaches with regard to formalizing and managing dialogues exist with their speci c strength and weaknesses. In this paper, we present existing dialogue management techniques as patterns to give a basis for decision support when develo...
متن کامل